aboutsummaryrefslogtreecommitdiff
path: root/_slides/2020-10-19-rollout-feature-flag-experiment-operational-toggle.slides
blob: 22770e6f37364b363c2069ec5ece36b53a82fffa (plain) (blame)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
---

title: Rollout, feature flag, experiment, operational toggle

date: 2020-10-19

layout: slides

lang: en

ref: rollout-feature-flag-experiment-operational-toggle

article: _articles/2020-10-19-feature-flags-differences-between-backend-frontend-and-mobile.md

---

# Rollout, feature flag, experiment, operational toggle
Different use cases for **backend**, **frontend** and **mobile**

---

"Feature flags" tend to come up when talking about **continuous deployment**

???

I'm using "quotes" because I'm mixing up different meanings of "rollout"

---

# CI
continuous integration

# CD
continuous delivery

# CD
**continuous deployment**

???

Background: build vocabulary, why are feature flags related to CD

CI solves: manual integration of long-lived branches

CD solves: automation of deployment process

CD solves: releases as frequent as possible

That's where the "GoCD" name comes from

---

# Types:
1. rollout
2. feature flag
3. experiment
4. operational toggle

---

# rollout
## For *rolling out* a new version of software

**Short-lived** using **percentages**

- a [new deployment of k8s][k8s]
- new [APK released to the Play Store][apk]

[k8s]: https://kubernetes.io/docs/concepts/workloads/controllers/deployment/#creating-a-deployment
[apk]: https://support.google.com/googleplay/android-developer/answer/6346149?hl=en

???

Relevant as long as the new code is deployed

---

# feature flag
## For turning a feature *on* or *off*

**Medium-lived** using **allow list**, **A/B test**, **percentage**,
**app version**, *etc*.

- `:new-chargeback-flow`
- `:new-debit-card-activation-screen`

???

Relevant as long as the new code is being developed

---

# experiment
## For analyzing behaviour

**Medium-lived** using **allow list** and **A/B test**

- `:debit-withdrawal-test`

---

# operational toggle
## For disabling features in `#crash`-like situations

**Long-lived** using **percentage**

- `:bank-barcode-payment`
- `:savings-bank-barcode-query-provider`

???

Lives for as long as the code is in production.

It feels like a system-level circuit breaker.

---

We now know about the types

## But they have different relevance for **backend**, **frontend** and **mobile**

---

# backend

1. **rollout**: k8s blue/green, canary and ~`common-rollout`~ `common-xp`
2. **feature flag**: ~`common-rollout`~ `common-xp` and datasets
3. **experiment**: `common-xp`
4. **operational toggle**: ~`common-rollout`~ `common-xp`

???

This is a bit why common-rollout isn't called *common-feature-flag*: it was
initially designed with backend usage of mostly *rollouts* in mind, and just a
bit *feature flags*.

Avoid using configuration for doing operational toggles: it is less dynamic, so
it defeats the purpose.

---

# frontend

1. **rollout**: CDN and page refreshes
2. **feature flag**: percentages and maybe IPs (no `:customer/id` on the website)
3. **experiment**: via dynamic backend control
4. **operational toggle**: via dynamic backend control

---

# mobile

1. **rollout**: app stores
2. **feature flag**: via dynamic backend control
3. **experiment**: via dynamic backend control
4. **operational toggle**: via dynamic backend control

---

Key differentiator is
## How much **control** we have over the **environment**

---

## **backend**

# Full control
🎉

???

Can edit, update and even delete rollouts as desired.

Mix and match at will!

---

## **frontend**

# Partial control

When choose when to make a new version available

???

We can control when a new version is available, partially when someone will
upgrade it.

But it is easy to fallback to "reload the page and try again".

---

## **mobile**

# Very limited control

- app stores can restrict updates (worse for iOS)
- customers still have to download new versions

---

# Costs

- more complex code
- compatibility with old app versions
- nesting is exponential

---

# Benefits

- dynamicity

---

## Weighting costs × benefits

The less control we have, the more we value dynamicity

---

## Weighting costs × benefits

- backend: sometimes worth the cost
- frontend: almost always worth cost
- mobile: **always** worth cost

---

# Best practices

---

## Dynamic content > feature flag

Always true for **mobile**, almost always for **frontend**

---

## Use `:include-list` for named groups

Always true for **backend**, **frontend** and **mobile**

{% raw %}
```clojure [2-3]
{:rules
 #{{:type    :include-list
    :content {:filename "debit-team-members.txt"}}}}
```
{% endraw %}

---

## Always use `:app-version`

only for **mobile**

{% raw %}
```clojure [2]
{:rules
 #{{:type    :app-version
    :content {:min-version #{{:platform :android
                              :code     1000000}
                             {:platform :ios
                              :code     2000000}}}}}}
```
{% endraw %}

---

## Extend ~`common-rollout`~ `common-xp` if required

That's how `:include-list`, `:app-version`, *etc.* were born

---

## Beware of many nested feature flags

True for **backend**, **frontend** and **mobile**

???

Exponential growth of combinations

---

## Don't delete app-facing feature flags

True for **mobile**

???

This could break old app versions, only do this intentionally

We don't have (yet) a strategy for dealing with LTS of the app, and we just say:
"we'll support every app version out there".

---

## Include a feature flag on the whiteboarding phase

---

## Include deleting/retiring the feature flag at the end

---

## Avoid renaming a feature flag

Use `:app-version` with `:min-version` instead

---

# And most importantly...

---

# ***Always*** rely on a feature flag on the app

Never do a hot fix, avoid expedited releases at all costs

???

The app is where we have less control, so the feature flag is how we get some of
that control back.

This doesn't mean you'll need 1 feature flag per PR

There's not such thing as:
"This is such a small thing, it doesn't need a feature flag"

You should ask yourself:
"It this crashes the app, am I OK with waiting for the next release train?"

---

## Thank you!

References:

1. "[Feature Toggles (aka Feature Flags)](https://martinfowler.com/articles/feature-toggles.html)", by Pete Hodgson
1. "[Continuous integration vs. continuous delivery vs. continuous deployment](https://www.atlassian.com/continuous-delivery/principles/continuous-integration-vs-delivery-vs-deployment)", by Sten Pittet
1. [Accelerate](https://itrevolution.com/book/accelerate/), by N. Forsgren, J. Humble and G. Kim